Overview

Dataset statistics

Number of variables15
Number of observations2775
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory325.3 KiB
Average record size in memory120.0 B

Variable types

Numeric15

Warnings

gross_revenue is highly correlated with qtde_itemsHigh correlation
qtde_items is highly correlated with gross_revenueHigh correlation
avg_ticket is highly skewed (γ1 = 27.68809767) Skewed
frequency_purchase is highly skewed (γ1 = 47.42232667) Skewed
df_index has unique values Unique
recency_days has 33 (1.2%) zeros Zeros
returns has 1483 (53.4%) zeros Zeros

Reproduction

Analysis started2021-05-25 14:12:44.340729
Analysis finished2021-05-25 14:13:14.086198
Duration29.75 seconds
Software versionpandas-profiling v2.13.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct2775
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2248.812252
Minimum0
Maximum5694
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:14.224701image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile181.4
Q1895.5
median2055
Q33408.5
95-th percentile4956.3
Maximum5694
Range5694
Interquartile range (IQR)2513

Descriptive statistics

Standard deviation1526.103739
Coefficient of variation (CV)0.6786265673
Kurtosis-0.9555046207
Mean2248.812252
Median Absolute Deviation (MAD)1238
Skewness0.3811143644
Sum6240454
Variance2328992.623
MonotonicityStrictly increasing
2021-05-25T11:13:14.372531image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
< 0.1%
5991
 
< 0.1%
26381
 
< 0.1%
5911
 
< 0.1%
5931
 
< 0.1%
26421
 
< 0.1%
5951
 
< 0.1%
26441
 
< 0.1%
5971
 
< 0.1%
26461
 
< 0.1%
Other values (2765)2765
99.6%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
ValueCountFrequency (%)
56941
< 0.1%
56841
< 0.1%
56781
< 0.1%
56531
< 0.1%
56471
< 0.1%

customer_id
Real number (ℝ≥0)

Distinct2767
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15277.96396
Minimum12347
Maximum18287
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:14.522445image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum12347
5-th percentile12614.4
Q113804.5
median15240
Q316779.5
95-th percentile17950.3
Maximum18287
Range5940
Interquartile range (IQR)2975

Descriptive statistics

Standard deviation1721.261834
Coefficient of variation (CV)0.112663038
Kurtosis-1.211386655
Mean15277.96396
Median Absolute Deviation (MAD)1489
Skewness0.01595493513
Sum42396350
Variance2962742.301
MonotonicityNot monotonic
2021-05-25T11:13:14.667122image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
123702
 
0.1%
124312
 
0.1%
124572
 
0.1%
123942
 
0.1%
124222
 
0.1%
124172
 
0.1%
124552
 
0.1%
124292
 
0.1%
163841
 
< 0.1%
150021
 
< 0.1%
Other values (2757)2757
99.4%
ValueCountFrequency (%)
123471
< 0.1%
123481
< 0.1%
123521
< 0.1%
123561
< 0.1%
123581
< 0.1%
ValueCountFrequency (%)
182871
< 0.1%
182831
< 0.1%
182821
< 0.1%
182731
< 0.1%
182721
< 0.1%

gross_revenue
Real number (ℝ≥0)

HIGH CORRELATION

Distinct2762
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2845.153449
Minimum36.56
Maximum279138.02
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:14.817298image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum36.56
5-th percentile264.566
Q1629.045
median1169.94
Q32427.565
95-th percentile7395.585
Maximum279138.02
Range279101.46
Interquartile range (IQR)1798.52

Descriptive statistics

Standard deviation10462.81725
Coefficient of variation (CV)3.677417558
Kurtosis373.090023
Mean2845.153449
Median Absolute Deviation (MAD)691.16
Skewness17.1046029
Sum7895300.82
Variance109470544.8
MonotonicityNot monotonic
2021-05-25T11:13:14.965076image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3226.12
 
0.1%
1080.482
 
0.1%
2339.862
 
0.1%
3425.692
 
0.1%
3312
 
0.1%
734.942
 
0.1%
6382.452
 
0.1%
745.062
 
0.1%
379.652
 
0.1%
2043.232
 
0.1%
Other values (2752)2755
99.3%
ValueCountFrequency (%)
36.561
< 0.1%
521
< 0.1%
52.21
< 0.1%
62.431
< 0.1%
68.841
< 0.1%
ValueCountFrequency (%)
279138.021
< 0.1%
259657.31
< 0.1%
194550.791
< 0.1%
140450.721
< 0.1%
124564.531
< 0.1%

recency_days
Real number (ℝ≥0)

ZEROS

Distinct252
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.57369369
Minimum0
Maximum372
Zeros33
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:15.114690image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q110
median29
Q373
95-th percentile211
Maximum372
Range372
Interquartile range (IQR)63

Descriptive statistics

Standard deviation68.34408861
Coefficient of variation (CV)1.208054206
Kurtosis3.456145254
Mean56.57369369
Median Absolute Deviation (MAD)23
Skewness1.902657391
Sum156992
Variance4670.914448
MonotonicityNot monotonic
2021-05-25T11:13:15.252415image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
199
 
3.6%
486
 
3.1%
386
 
3.1%
285
 
3.1%
876
 
2.7%
1067
 
2.4%
967
 
2.4%
765
 
2.3%
1762
 
2.2%
2255
 
2.0%
Other values (242)2027
73.0%
ValueCountFrequency (%)
033
 
1.2%
199
3.6%
285
3.1%
386
3.1%
486
3.1%
ValueCountFrequency (%)
3721
 
< 0.1%
3661
 
< 0.1%
3601
 
< 0.1%
3583
0.1%
3541
 
< 0.1%

qtde_invoices
Real number (ℝ≥0)

Distinct55
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.05981982
Minimum2
Maximum206
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:15.389095image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q12
median4
Q36
95-th percentile17
Maximum206
Range204
Interquartile range (IQR)4

Descriptive statistics

Standard deviation9.070779717
Coefficient of variation (CV)1.496872842
Kurtosis183.9137519
Mean6.05981982
Median Absolute Deviation (MAD)2
Skewness10.62190999
Sum16816
Variance82.27904467
MonotonicityNot monotonic
2021-05-25T11:13:15.526544image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2778
28.0%
3498
17.9%
4394
14.2%
5238
 
8.6%
6174
 
6.3%
7138
 
5.0%
897
 
3.5%
970
 
2.5%
1055
 
2.0%
1154
 
1.9%
Other values (45)279
 
10.1%
ValueCountFrequency (%)
2778
28.0%
3498
17.9%
4394
14.2%
5238
 
8.6%
6174
 
6.3%
ValueCountFrequency (%)
2061
< 0.1%
1991
< 0.1%
1241
< 0.1%
971
< 0.1%
912
0.1%

qtde_items
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1634
Distinct (%)58.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1672.284685
Minimum2
Maximum196844
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:15.690387image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile119.7
Q1330.5
median707
Q31485.5
95-th percentile4610.5
Maximum196844
Range196842
Interquartile range (IQR)1155

Descriptive statistics

Standard deviation5888.405154
Coefficient of variation (CV)3.521173882
Kurtosis485.8647154
Mean1672.284685
Median Absolute Deviation (MAD)455
Skewness18.18482489
Sum4640590
Variance34673315.26
MonotonicityNot monotonic
2021-05-25T11:13:15.823424image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31011
 
0.4%
2468
 
0.3%
1508
 
0.3%
4937
 
0.3%
3007
 
0.3%
2727
 
0.3%
2197
 
0.3%
3947
 
0.3%
5167
 
0.3%
12007
 
0.3%
Other values (1624)2699
97.3%
ValueCountFrequency (%)
21
< 0.1%
161
< 0.1%
171
< 0.1%
191
< 0.1%
201
< 0.1%
ValueCountFrequency (%)
1968441
< 0.1%
802631
< 0.1%
773731
< 0.1%
699931
< 0.1%
645491
< 0.1%

qtde_products
Real number (ℝ≥0)

Distinct466
Distinct (%)16.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.7556757
Minimum2
Maximum7838
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:15.963335image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile10
Q134
median72
Q3143
95-th percentile399.9
Maximum7838
Range7836
Interquartile range (IQR)109

Descriptive statistics

Standard deviation277.7016109
Coefficient of variation (CV)2.140188546
Kurtosis337.1075724
Mean129.7556757
Median Absolute Deviation (MAD)45
Skewness15.35671564
Sum360072
Variance77118.1847
MonotonicityNot monotonic
2021-05-25T11:13:16.097043image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2838
 
1.4%
3534
 
1.2%
2630
 
1.1%
2930
 
1.1%
2730
 
1.1%
2528
 
1.0%
3127
 
1.0%
1527
 
1.0%
1927
 
1.0%
3326
 
0.9%
Other values (456)2478
89.3%
ValueCountFrequency (%)
211
0.4%
312
0.4%
416
0.6%
516
0.6%
624
0.9%
ValueCountFrequency (%)
78381
< 0.1%
56731
< 0.1%
50951
< 0.1%
45801
< 0.1%
26981
< 0.1%

qtde_unique_products
Real number (ℝ≥0)

Distinct340
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.35495495
Minimum1
Maximum1786
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:16.235301image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8
Q129
median57
Q3105
95-th percentile239.3
Maximum1786
Range1785
Interquartile range (IQR)76

Descriptive statistics

Standard deviation98.70296808
Coefficient of variation (CV)1.184128384
Kurtosis80.66815713
Mean83.35495495
Median Absolute Deviation (MAD)33
Skewness6.352949609
Sum231310
Variance9742.275908
MonotonicityNot monotonic
2021-05-25T11:13:16.361448image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3738
 
1.4%
2437
 
1.3%
2636
 
1.3%
2535
 
1.3%
3334
 
1.2%
2834
 
1.2%
3032
 
1.2%
1832
 
1.2%
1530
 
1.1%
5229
 
1.0%
Other values (330)2438
87.9%
ValueCountFrequency (%)
119
0.7%
213
0.5%
317
0.6%
418
0.6%
523
0.8%
ValueCountFrequency (%)
17861
< 0.1%
17661
< 0.1%
13221
< 0.1%
11181
< 0.1%
8841
< 0.1%

avg_ticket
Real number (ℝ≥0)

SKEWED

Distinct2767
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.10453979
Minimum2.150588235
Maximum4453.43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:16.495892image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum2.150588235
5-th percentile4.852950497
Q112.43090098
median17.94081081
Q325.10118182
95-th percentile87.67205263
Maximum4453.43
Range4451.279412
Interquartile range (IQR)12.67028084

Descriptive statistics

Standard deviation107.5928583
Coefficient of variation (CV)3.351328474
Kurtosis1055.381089
Mean32.10453979
Median Absolute Deviation (MAD)6.342522523
Skewness27.68809767
Sum89090.09792
Variance11576.22316
MonotonicityNot monotonic
2021-05-25T11:13:16.621566image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20.636686752
 
0.1%
27.392489272
 
0.1%
17.628961752
 
0.1%
34.631016952
 
0.1%
17.988421052
 
0.1%
36.046808512
 
0.1%
43.21922
 
0.1%
27.527764712
 
0.1%
3.6328813561
 
< 0.1%
8.2921568631
 
< 0.1%
Other values (2757)2757
99.4%
ValueCountFrequency (%)
2.1505882351
< 0.1%
2.43251
< 0.1%
2.4623711341
< 0.1%
2.5112413791
< 0.1%
2.5153333331
< 0.1%
ValueCountFrequency (%)
4453.431
< 0.1%
1687.21
< 0.1%
952.98751
< 0.1%
872.131
< 0.1%
841.02144931
< 0.1%

avg_recency_days
Real number (ℝ≥0)

Distinct305
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78.42918919
Minimum1
Maximum366
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:16.751738image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13
Q134
median59
Q399
95-th percentile224
Maximum366
Range365
Interquartile range (IQR)65

Descriptive statistics

Standard deviation66.52989591
Coefficient of variation (CV)0.8482797871
Kurtosis3.700911137
Mean78.42918919
Median Absolute Deviation (MAD)30
Skewness1.833646582
Sum217641
Variance4426.22705
MonotonicityNot monotonic
2021-05-25T11:13:16.881224image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3540
 
1.4%
7039
 
1.4%
5537
 
1.3%
3136
 
1.3%
4536
 
1.3%
2136
 
1.3%
2535
 
1.3%
2634
 
1.2%
4634
 
1.2%
3833
 
1.2%
Other values (295)2415
87.0%
ValueCountFrequency (%)
19
0.3%
25
0.2%
38
0.3%
48
0.3%
55
0.2%
ValueCountFrequency (%)
3661
< 0.1%
3651
< 0.1%
3641
< 0.1%
3631
< 0.1%
3572
0.1%

returns
Real number (ℝ≥0)

ZEROS

Distinct23
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.124684685
Minimum0
Maximum45
Zeros1483
Zeros (%)53.4%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:17.004068image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile4
Maximum45
Range45
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.375741003
Coefficient of variation (CV)2.112361834
Kurtosis106.8386356
Mean1.124684685
Median Absolute Deviation (MAD)0
Skewness7.759467296
Sum3121
Variance5.644145313
MonotonicityNot monotonic
2021-05-25T11:13:17.110194image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
01483
53.4%
1656
23.6%
2270
 
9.7%
3139
 
5.0%
492
 
3.3%
538
 
1.4%
632
 
1.2%
721
 
0.8%
98
 
0.3%
125
 
0.2%
Other values (13)31
 
1.1%
ValueCountFrequency (%)
01483
53.4%
1656
23.6%
2270
 
9.7%
3139
 
5.0%
492
 
3.3%
ValueCountFrequency (%)
451
< 0.1%
441
< 0.1%
351
< 0.1%
271
< 0.1%
211
< 0.1%

latitude
Real number (ℝ)

Distinct28
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.48388445
Minimum-25.274398
Maximum64.963051
Zeros0
Zeros (%)0.0%
Negative8
Negative (%)0.3%
Memory size21.8 KiB
2021-05-25T11:13:17.221454image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum-25.274398
5-th percentile50.503887
Q155.378051
median55.378051
Q355.378051
95-th percentile55.378051
Maximum64.963051
Range90.237449
Interquartile range (IQR)0

Descriptive statistics

Standard deviation5.18198778
Coefficient of variation (CV)0.09511046858
Kurtosis164.9603122
Mean54.48388445
Median Absolute Deviation (MAD)0
Skewness-11.49946557
Sum151192.7794
Variance26.85299735
MonotonicityNot monotonic
2021-05-25T11:13:17.324466image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
55.3780512515
90.6%
51.16569167
 
2.4%
46.22763858
 
2.1%
40.46366720
 
0.7%
50.50388719
 
0.7%
46.81818812
 
0.4%
39.39987211
 
0.4%
-25.2743988
 
0.3%
56.263927
 
0.3%
61.924117
 
0.3%
Other values (18)51
 
1.8%
ValueCountFrequency (%)
-25.2743988
0.3%
1.3520831
 
< 0.1%
31.0460511
 
< 0.1%
35.1264134
0.1%
35.9374961
 
< 0.1%
ValueCountFrequency (%)
64.9630511
 
< 0.1%
61.924117
0.3%
60.4720246
0.2%
60.1281614
0.1%
56.263927
0.3%

longitude
Real number (ℝ)

Distinct28
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-1.911966504
Minimum-106.346771
Maximum138.252924
Zeros0
Zeros (%)0.0%
Negative2552
Negative (%)92.0%
Memory size21.8 KiB
2021-05-25T11:13:17.435636image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum-106.346771
5-th percentile-3.435973
Q1-3.435973
median-3.435973
Q3-3.435973
95-th percentile8.227512
Maximum138.252924
Range244.599695
Interquartile range (IQR)0

Descriptive statistics

Standard deviation10.70437596
Coefficient of variation (CV)-5.598621071
Kurtosis134.0336741
Mean-1.911966504
Median Absolute Deviation (MAD)0
Skewness9.896905264
Sum-5305.707049
Variance114.5836646
MonotonicityNot monotonic
2021-05-25T11:13:17.567272image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
-3.4359732515
90.6%
10.45152667
 
2.4%
2.21374958
 
2.1%
-3.7492220
 
0.7%
4.46993619
 
0.7%
8.22751212
 
0.4%
-8.22445411
 
0.4%
133.7751368
 
0.3%
9.5017857
 
0.3%
25.7481517
 
0.3%
Other values (18)51
 
1.8%
ValueCountFrequency (%)
-106.3467711
 
< 0.1%
-95.7128911
 
< 0.1%
-19.0208351
 
< 0.1%
-8.243893
 
0.1%
-8.22445411
0.4%
ValueCountFrequency (%)
138.2529245
0.2%
133.7751368
0.3%
103.8198361
 
< 0.1%
34.8516121
 
< 0.1%
33.4298594
0.1%

frequency_purchase
Real number (ℝ≥0)

SKEWED

Distinct1228
Distinct (%)44.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.06181163036
Minimum0.005464480874
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:17.707137image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum0.005464480874
5-th percentile0.008771929825
Q10.01587301587
median0.02452316076
Q30.04215713428
95-th percentile0.1179152512
Maximum34
Range33.99453552
Interquartile range (IQR)0.02628411841

Descriptive statistics

Standard deviation0.6693187857
Coefficient of variation (CV)10.82836324
Kurtosis2387.960895
Mean0.06181163036
Median Absolute Deviation (MAD)0.01078689703
Skewness47.42232667
Sum171.5272742
Variance0.4479876369
MonotonicityNot monotonic
2021-05-25T11:13:17.851628image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0714285714316
 
0.6%
0.0476190476215
 
0.5%
0.0158730158714
 
0.5%
0.030303030314
 
0.5%
0.0285714285714
 
0.5%
0.0238095238113
 
0.5%
0.0645161290313
 
0.5%
0.142857142913
 
0.5%
0.02512
 
0.4%
0.117647058812
 
0.4%
Other values (1218)2639
95.1%
ValueCountFrequency (%)
0.0054644808741
< 0.1%
0.0054794520551
< 0.1%
0.0054945054951
< 0.1%
0.0055096418731
< 0.1%
0.0056022408962
0.1%
ValueCountFrequency (%)
341
 
< 0.1%
61
 
< 0.1%
41
 
< 0.1%
26
0.2%
1.51
 
< 0.1%

avg_basket_size
Real number (ℝ≥0)

Distinct974
Distinct (%)35.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.48464482
Minimum1
Maximum297.8823529
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-05-25T11:13:18.011793image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.5
Q110.04545455
median17
Q327.33333333
95-th percentile54.215
Maximum297.8823529
Range296.8823529
Interquartile range (IQR)17.28787879

Descriptive statistics

Standard deviation17.84655749
Coefficient of variation (CV)0.8306656982
Kurtosis27.52701801
Mean21.48464482
Median Absolute Deviation (MAD)8
Skewness3.258241693
Sum59619.88938
Variance318.4996143
MonotonicityNot monotonic
2021-05-25T11:13:18.167475image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1345
 
1.6%
1730
 
1.1%
1130
 
1.1%
1428
 
1.0%
126
 
0.9%
7.525
 
0.9%
1525
 
0.9%
925
 
0.9%
2324
 
0.9%
9.524
 
0.9%
Other values (964)2493
89.8%
ValueCountFrequency (%)
126
0.9%
1.21
 
< 0.1%
1.251
 
< 0.1%
1.3333333332
 
0.1%
1.58
 
0.3%
ValueCountFrequency (%)
297.88235291
< 0.1%
1911
< 0.1%
135.33333331
< 0.1%
129.751
< 0.1%
125.751
< 0.1%

Interactions

2021-05-25T11:12:48.605796image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:48.752331image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:48.871144image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:48.992069image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:49.116180image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:49.241040image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:49.364278image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:49.490011image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:49.607279image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:49.755100image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:49.887300image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:50.023063image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:50.173623image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:50.316324image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:50.457560image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:50.602406image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:50.742281image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:50.897120image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.061815image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.190544image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.316882image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.436684image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.545329image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.663480image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.767449image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.869420image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:51.972748image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.084965image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.190087image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.291316image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.400406image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.520286image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.643295image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.769544image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:52.893350image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.014545image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.130864image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.250621image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.363457image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.475129image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.587986image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.708809image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.827307image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:53.970088image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:54.108796image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:54.229167image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:54.385258image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:54.541339image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:54.712295image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:54.865048image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:54.974544image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.089943image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.194327image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.298289image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.406543image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.516240image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.626470image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.736058image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.845363image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:55.958046image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.095456image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.257588image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.385981image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.503081image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.626301image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.750823image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.857939image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:56.964423image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.078945image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.196653image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.311076image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.433354image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.570393image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.707312image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.839212image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:57.975901image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:58.115737image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:58.254445image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:58.381160image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:58.523974image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:58.648452image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:58.773763image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:58.902456image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.034020image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.160232image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.275856image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.389108image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.500803image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.619626image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.735730image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.856471image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:12:59.973094image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.085186image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.200137image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.314160image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.427080image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.539202image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.664615image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.781788image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:00.894496image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.006240image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.118921image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.230620image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.349119image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.467565image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.585252image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.691531image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.803543image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:01.907386image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.009446image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.135504image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.268842image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.383560image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.496547image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.612522image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.726853image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.840061image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:02.960238image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.081573image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.206401image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.322083image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.438551image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.544767image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.664345image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.774772image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:03.890164image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.004857image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.130286image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.255048image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.380981image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.508016image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.637017image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.764539image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.884320image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:04.999019image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.109550image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.214583image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.320354image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.429108image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.542695image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.655846image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.753920image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.854473image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:05.951653image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.052831image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.159314image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.264431image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.374510image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.475818image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.578948image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.683763image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.781643image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.878424image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:06.979460image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.083050image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.181155image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.280063image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.378308image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.493752image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.610556image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.730250image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.847340image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:07.963805image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.074055image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.191181image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.295628image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.408070image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.523703image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.639185image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.752128image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.867752image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:08.982945image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.100906image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.228716image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.351502image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.473385image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.597032image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.713892image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.832190image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:09.943158image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.051794image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.161144image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.266238image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.376198image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.497176image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.619885image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.750745image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.873019image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:10.993773image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.110130image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.223821image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.326953image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.439059image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.542773image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.651985image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.764256image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.875639image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:11.978969image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.084784image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.192136image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.300408image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.414883image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.542981image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.679739image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.800382image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:12.915965image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:13.040850image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:13.153394image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:13.272880image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
2021-05-25T11:13:13.389562image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Correlations

2021-05-25T11:13:18.315823image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-05-25T11:13:18.538996image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-05-25T11:13:18.776426image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-05-25T11:13:19.001061image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2021-05-25T11:13:13.642607image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
A simple visualization of nullity by column.
2021-05-25T11:13:13.960173image/svg+xmlMatplotlib v3.4.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexcustomer_idgross_revenuerecency_daysqtde_invoicesqtde_itemsqtde_productsqtde_unique_productsavg_ticketavg_recency_daysreturnslatitudelongitudefrequency_purchaseavg_basket_size
00178505391.21372.034.01733.0297.021.018.1522221.01.055.378051-3.43597334.0000008.735294
11130473232.5956.09.01390.0171.0105.018.90403552.07.055.378051-3.4359730.02839119.000000
22125836705.382.015.05028.0232.0114.028.90250026.02.046.2276382.2137490.04043115.466667
3313748948.2595.05.0439.028.024.033.86607192.00.055.378051-3.4359730.0179865.600000
4415100876.00333.03.080.03.01.0292.00000020.03.055.378051-3.4359730.0750001.000000
55152914623.3025.014.02102.0102.061.045.32647126.05.055.378051-3.4359730.0402307.285714
66146885630.877.021.03621.0327.0148.017.21978619.06.055.378051-3.4359730.05737715.285714
77178095411.9116.012.02057.061.046.088.71983639.02.055.378051-3.4359730.0336135.083333
881531160767.900.091.038194.02379.0567.025.5434644.027.055.378051-3.4359730.24396825.901099
99160982005.6387.07.0613.067.034.029.93477647.00.055.378051-3.4359730.0244769.428571

Last rows

df_indexcustomer_idgross_revenuerecency_daysqtde_invoicesqtde_itemsqtde_productsqtde_unique_productsavg_ticketavg_recency_daysreturnslatitudelongitudefrequency_purchaseavg_basket_size
2765560917290525.243.02.0404.0102.092.05.14941213.00.055.378051-3.4359730.15384647.000000
276656181478577.4010.02.084.03.02.025.8000005.00.055.378051-3.4359730.4000001.500000
2767561917254272.444.02.0252.0112.0100.02.43250011.00.055.378051-3.4359730.18181855.500000
2768563517232421.522.02.0203.036.030.011.70888912.00.055.378051-3.4359730.16666717.500000
2769563617468137.0010.02.0116.05.05.027.4000004.00.055.378051-3.4359730.5000002.500000
2770564713596697.045.02.0406.0166.0133.04.1990367.00.055.378051-3.4359730.28571469.000000
27715653148931237.859.02.0799.073.072.016.9568492.00.055.378051-3.4359731.00000036.500000
2772567814126706.137.03.0508.015.014.047.0753333.01.055.378051-3.4359731.0000005.000000
27735684135211092.391.03.0733.0435.0312.02.5112414.00.055.378051-3.4359730.333333135.333333
2774569415060301.848.04.0262.0120.080.02.5153331.00.055.378051-3.4359734.00000027.500000